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ABSTRACT 


Introduction: Swine Flu cases are rarely observed but are to be found subsequently increasing in many countries and also in 
India. They can be treated and diagnosed if it is detected at an early stage. However, many times it is difficult to predict as the 
symptoms are almost similar to other viral fevers. Many researchers used various algorithms and different approaches to predict 
swine flu. 

Aim and Objective: In this paper, we propose Naive Bayesian classifier for an efficient prediction, by comparing the symptoms 
and applying significant signs and prediction factors to Bayes theorem to predict the patient’s condition. 

Method: Here in this work, the machine learning algorithm is used on the available data to reduce the time and to predict the dis- 
ease perfectly. We collected 14 sample records of different swine flu suspected patients from King George Hospital, Visakhapa- 
tnam, India, based on the sample reports. We prepare the training SF (Swine Flu) dataset. The ESFP (Efficient Swine Flu 
Prediction) is first trained by using SF (Swine Flu) dataset with some of the significant symptoms observed in Swine Flu cases. 
Conclusion: The first signs and the prediction values are applied to Naive Bayesian Classifier by using the training set, and then 
the test set is applied to predict the disease prevailing or not.. 
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INTRODUCTION done. However, the Swine Flu is first observed in a few plac- 
es in Mexico and is then coined as H1N1. Later in the USA, 
the same H1N1 virus is found in people, and drastically that 
causing Swine Flu and was the foremost frequent reason be- has spread a lot, and more than 10000 persons are severely 


hind human influenza, also called u in the year 2009. HIN! affected in over forty plus countries. In 2011 the latest one is 
leaves a few strains that are prevalent in human that creates a specified to be H3N2v with Influenza HIN1°°. 


slight part of every disease- resembling sickness and a little ; ; ; ; ; 
piece of entire Influenza that are seasonal. Additional strains Machine learning plays a vital role in solving problems for 


of the specified virus of other types are also observed in birds Many solutions and also helps to predict future results in 
and pigs’. today’s world. Artificial intelligence makes use of machine 


learning techniques to solve many problems. It helps to train 
The Swine Flu is initially generated by viruses related to the algorithms by giving the sample data as input to the algo- 
Influenza that are disease-related to respiration that affect  tithms and make it to learn how to make decisions from past 
respiratory organs in pigs, leading to careless actions, nasal experiences or results and execute the tasks smartly ”. 
secretions, decreased hunger and barking cough. The same 


types of symptoms are observed in people affected by Hu- In medical diagnosis system, the main time-consuming task 
man u as in pigs affected by Swine Flu >. is to identify the disease. Late identification leads to the se- 


verity risk some times in diagnosing the condition. Machine 
learning techniques have played a significant role in the 
present generation to predict and classify the severity level 


The subclass of Influenza Virus is a virus Influenza A HIN1 


In the USA the Swine Influenza U was initially in around 
1930 seduced in pigs. A lot of investigations were being 
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in different stages. Machine Learning has predicted many 
diseases like diabetes, brain tumour and made effective di- 
agnosis plan. One of the most time-consuming conditions 
nowadays is Swine Flu. By applying the dataset to the ma- 
chine learning algorithm, we are going to predict the severity 
levels of H1N1 among the patients”®. 


Existing system 

Flu viruses are spread mostly from an individual to another 
individual using any medium either by sneezing or direct 
coughs from patients suffering from this virus. This can 
sometimes be affected even by touching the surfaces such as 
furniture or walls which are touched by the patient by which 
some people will be hospitalized. If the specified virus is not 
detected on time and eradication, then diseases like Swine 
Flu will be spread rapidly. The manual approaches are time 
taking’. 


Proposed system 

The proposed System on Artificial Intelligent System is 
used for Efficient Swine Flu Detection, which is reliable and 
competent. This requires a proper training set and test set 
for training the system. It requires less time to predict the 
stage of disease by applying the data to the machine learning 
algorithm. 


Efficient Swine Flu Detection using Naive 
Bayesian Classifier 

Many researchers used different machine learning Clas- 
sifiers to predict Swine Flu”. The entire machine learning 
algorithms has its advantages from a different perspective. 
Decision trees work better, but it requires a lot of data to get 
to build the training set. Naive Bayesian works better even 
if it doesn’t contain all the possibilities, and it also requires 
less quantity of data. This is the reason why we preferred 
naive Bayesian Classifier to predict the severity of Swine Flu 
based on 14 Swine Flu patients’ records. It requires less data 
as input and produces a fast outcome. 


Swine Flu Symptoms 

The symptoms such as headache, Joint Pains, Muscle Pains, 
Chills, running nose, dry cough and fatigue in any Influenzas 
and they are similar in Swine Flu. However few cases have 
been reporting that include Diarrhea, fever, vomiting, and 
neurological issues. 


The high risk is observed in an individual with severe com- 
plications include elderly age group, kids less than five years 
of age, children with neurodevelopment anomalies, women 
who are pregnant and such people who are prone with medi- 
cal conditions that are abnormal and underlying such as dia- 
betic patients, people who have asthma or cardiac patients 
and such people who are immune-suppressed, people suffer- 
ing from obesity in any age group’”. 


Severe case symptoms 

It is stated by WHO, that the medical image in stern case pa- 
tients is completely variant in Swine Flu compared to disease 
pattern observed in epidemics. If the people with sick medi- 
cal history are pretended and expected headed for high risk, 
numerous critical cases are also found in hale and hearty 
people with an excellent medical history. Though research 
is presently going on the influencing factors that amplify the 
extreme danger of rigorous unhealthiness don’t seem to be 
currently understood, Patients with a severe health condi- 
tion are found to be okay initially. But around three to five 
days, generally they begin to get worse after the onset of 
symptoms. Fast deterioration is being observed with failure 
in respiration in several patients within twenty-four hours. 
Immediate respiratory support and intensive care unit sup- 
port is required with mechanical ventilation in the case of 
most patients’. 


The Centres for Disease Control (CDC) recommends imme- 
diate medical attention if any experience of warning signs of 
emergency is observed in person attacked with Swine Flu’. 
In adults, they embody problem in respiratory or squatness 
of breath, pressure and pain within abdomen or chest, Confu- 
sion, fast giddiness, severe vomiting persistently and Tem- 
perature. 


In kids they include difficulty in breathing or speed breath- 
ing, skin colour turns blue, not accepting fluids, feeling, 
lazy and sleepy, non-interacting, behaves irritably that the 
children usually don’t hold, symptoms related to other u that 
recover but eventually turns to worst cough and high tem- 
perature, high temperature with a rash, not eating, having no 
tear when crying. 


Complications 

Swine Flu affected patients who are suffering from obesity 
complications are being observed and are assaulted with 
prior respiratory disease are mainly infected by pneumo- 
nia disease, This happens mostly in adults who may be di- 
agnosed under ventilation in Information and communica- 
tions technology (ICT) due to acute respiratory organ injury. 
Infections due to bacteria are more commonly observed in 
children. Significant complications include different condi- 
tions like cardiac problems (myocarditis and pericarditis), 
myositis, central nervous systems complications, toxic shock 
syndrome and secondary bacterial pneumonia. 


The prominent cause of secondary bacterial pneumonia is 
due to Staphylococcus aureus, along with Methicillin-Resist- 
ant Staphylococcus Aureus (MRSA) with a higher mortality 
rate. Cardiac complications and Neuromuscular are uncom- 
mon, but there’s an opportunity of occurrence. Fulminate 
myocarditis and pulmonary emboli were other complications 
detected’. 
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Naive Bayesian Classifier 

It is one of the most used Machine Learning algorithms for 
predicting and diagnosing diseases **. Naive Bayesian is a 
generative model and works with small data sets where Lo- 
gistic regression works with large data sets, and it also works 
faster when compared with K- nearest neighbours. Another 
main advantage of naive Bayesian is it not only outputs pre- 
diction, but it also generates a degree of certainty, which is 
very useful in diagnosing. 


Naive Bayesian Classifiers belongs to classification algo- 
rithms which are based on the Bayes theorem. It is a family 
of algorithms that share a universal principle, but the pair of 
features are independent of each other. It is a classification 
algorithm which is suitable for both binary and multiclass 
classification. It is a classification technique which super- 
vises and helps to predict the future class labels by assigning 
instances using the conditional probability. 


The Bayes’ theorem plays a vital role in dictating the collec- 
tion of algorithms related to the classification which are cat- 
egorized as Efficient Naive Bayesian Classifiers. It describes 
the probability of an event to occur based on the conditions 
that are related to the game, when applied then the hazards 
involved in the Bayes theorem have different probability in- 
terpretations, 


This is tending to be a family of procedures that do not fol- 
low the algorithmic approach but follow a common standard 
principle that is common. Therefore in each mixing of choic- 
es that are self-determining in the classification of every al- 
ternative''. 


An SF dataset is being collected, and analysis is being done, 
which is describing the symptoms of Swine Flu based on 
that go for Initial diagnosis. Proceeding with the symptoms, 
each tuple is used to classify the prevailing conditions such 
as “Yes” or “No” for Initial diagnosis. 


By applying this SF data set to Naive Bayesian Classifier 
based upon the instances, it helps to predict the condition of 
patients by verifying all the possibilities. 


As this is the Classifier algorithm, we can classify the data 
and the requirements based on the symptoms of the Swine 
Flu patients and applying it to the Bayes theorem gives Ef- 
ficient results. 


Table 1: Representing the SF dataset shows the Symp- 
toms of 14 different patients along with Affected 
Area, Immunity and Severity of Symptoms. 


S. Fever Affected Immunity Severity Initial 
of Symp- 


toms 


No Symptoms Area 





Diagnosis 
High No No 
High No No 


1  Drycough Yes 
2  Drycough Yes 


Table 1: (Continued) 


S. Fever Affected Immunity Severity Initial 
of Symp- 


toms 


No Symptoms Area 


Diagnosis 





3 Cold Yes High Yes Yes 
4  Bodypains Yes High No No 
5  Bodypains No High No No 
6  Bodypains Yes High Yes Yes 
7 Cold No Low Yes Yes 
8 Drycough No Low No No 
9 Drycough Yes High Yes Yes 
10 Bodypains No Low Yes Yes 
u Drycough No Low Yes Yes 
12 Cold No Low Yes Yes 
3 Cold No Low Yes Yes 
14 Body pains No Low Yes Yes 


The data that is collected from the hospital is categorized 
into different ways based upon the symptoms faced by the 
patients. The collected information is trained based upon the 
various attributes which are not same in all of them. All of 
those combinations of symptoms are meant for probability 
and given to the algorithm to predict the stage of Swine Flu 
(SF). 


The data represented is partitioned into two elements, where 
the first one is SF Feature Matrix, whereas next is SF Re- 
sponse Vector. SF Feature Matrix comprises of tuples of data 
during where every tuple consists of the significant rate of 
dependent SF Features. 


According to SF dataset referring to Table 1, SF Features are 
‘Fever symptoms’, ‘Effected area’, ‘Immunity’ and ‘Severity 
of symptoms’. SF Response Vector comprises the worth of 
SF class variable, which is either predicted or real expected 
result for every tuple of SF Feature Matrix related to fea- 
tures. The SF class variable name is the initial diagnoses ac- 
cording to the data considered. 


Assumptions 
Naive Bayes assumption that is fundamental is where every 
tuple of SF Feature: 


Equal outcome contribution with relation to the SF dataset, 
this idea is often considered to be in as accomplished area. 
This is Independent. This is assumed that none of the cou- 
ples of SF Features is reliant. For instance, the affected area 
be ‘Yes’ has not anything to do with the Immunity or Fever 
symptoms be ‘Dry cough’ has indeed not any effect on the 
severity of symptoms. Hence, based on the assumptions, the 
SF Features are treated to be independent 
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Next, every SF Feature will be provided equal importance 
or weight. Hence in an instance where only Effected area is 
known and Immunity alone won’t accurately forecast the re- 
sult. Not any of attribute is digressive, henceforth understood 
to be contributory equivalent to the outcome. Assumptions 
derived by the algorithm Naive Bayes may not be accurate 
generally inexact real-world scenario. The fact is that inde- 
pendent assumption works well, but it is always incorrect. 
First, the basic knowledge of Bayes’ theorem is required be- 
fore understanding the Naive Bayes. 


Bayes’ Theorem 

Bayes’ Theorem states that based on the probability of an oc- 
curring event and likelihood of another occurred event. This 
is shown mathematically with the equation', and here A and 
B are events considered !*:3: 


P(B/ A)P(A) 


P(A/B)= on 


(1) 


Mostly, we tend to try to search out 
Probability of an event A; specified that the event B is cor- 
rect. Event B is additionally termed as evidence. 


P (A) be a priority of A, which is the prior probability and is 
the event Probability earlier than evidence is observed. The 
proof is attributed worth of an indefinite object, and this can 
be event B. Hereafter evidence can be visualized the poste- 
rior probability of B is considered to be P (A|B) which is the 
probability of an event. 


Then, with deemed to SF dataset, it can apply Bayes’ theo- 
rem: 


P(X 1y)Py) 


P(Q/X)= POD (2) 


Where, X could be dependent SF Feature Vector of the size n 
and y is class variable, referring to the first tuple 


y=No 
X = (Dry cough, Yes, High, No) 


So in essence, P (X|y) at this point means, that the probability 
of “No Initial diagnosis” provided that symptoms based con- 
ditions can be “dry cough fever symptoms”, “Affected area 
is No”, “high immunity” and “no severity of symptoms”. 


Naive assumption 

Now, in this instance, it is required to place a naive assump- 
tion to the Bayes’ theorem that can be independent amongst 
the options regarding SF Features. So at this time, the evi- 
dence is split into the separate component". At this instance, 
if any two events A and B cannot be dependent, therefore it 
is defined as, 


P (A, B) =P (A)*P (B).. (3) 
Henceforth, the result can be denoted, as shown in equation 
(4). 


Pœ VIP! Y) P VPO) 
P(x P)... P(X) 





a x,) 


(4) 


This may be expressed as shown in equation (5) 


PO) [PG /¥) 


P(y/%, X,) = Pa) P(x.) or P(x,) I 





Next for any given input, being treating the denominator to 
be constant that term is removed, as shown in equation(6). 


P(y/x,_x,)%¢ PO] [PG y) (6) 


Now, it is needed to generate a model related to the classifier. 
So for this, discover the chance of a standard set of inputs for 
every attainable rate of the class variable y and acquire up 
the result with a most chance. 


The same can be shown mathematically as in equation (7) 


y =arg max, POV] | P(x,/y) (7) 
i=l 
So, finally, it tends to carry on with the task of calculating 
P(y) and P (xi | y). P(y) is referred to as class probability, and 
P (xi | y) is named conditional probability. The naive Bayes 
classifiers differ principally by the assumptions they create 
relating to the distribution of P (xi | y). 


RESULTS 


The results are reflected by a case study on Swine Flu cases 
observed by patient’s history affected by Swine Flu. The 
main symptoms are listed as follows. 


1. Fever symptoms: 
The victim is suffering from dry cough, cold, body 
pains during fever, high body temperature. 

2. Affected area: 
The victim came from a place where already people 
around the victim or the people who are residing in the 
victim’s area had been previously suffered by Swine 
Flu. 

3. Immunity: 
This is considered if the immunity power or resistance 
power towards the disease of the victim is high. 

4. The severity of symptoms: 
The victim is suffering from Breathing Difficulty, 
Chest pain, Abdomen pain, persistent vomiting, Diar- 
rhea. 
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All of the above four symptoms are used in prediction of 
Swine Flu. Now basing on the formulae specified in Sec- 
tion 2, they are applied on Assumptions SF dataset. For this, 
some pre-computation is done to SF dataset. Then for every 
xi in x and yj in y calculate the value P (xi | yj) for. The tables 
below are derived from these calculations and represent the 
expected probability values. 


Table 2: Expected Probability values w.r.to nature of 
Symptoms. 


Symptoms Nature of 





Yes No Probability Probability 


the Symp- (Yes) (No) 
toms 
Fever Dry cough 2 3 0.22 0.6 
symptoms Cold 4 o 0.44 o 
Body pains 3 2 0.33 0.4 
Affected Yes = 3 0.33 0.6 
ance No 6 2 0.67 0.4 
Immunity High 3 4 0.33 0.8 
Low 6 1 0.67 0.2 
Severity of Yes 9 o 0.64 o 
symptoms: Ng 5 o0 0.36 o 


So, as shown in the above, P (xi | yj) is calculated for each xi 
in X and yj in y manually in the tables 2 


Table 3: Probability of Initial Diagnosis 
Initial diagnosis Yes 9 0.64 
No 5 0.36 


For instance, the probability of Initial diagnosis given that 
the affected area is No, 


i.e P (Effected area = No | Initial diagnosis = Yes) = 6/9. 


Also, class probabilities (P(y)) are found that have been 
evaluated in table 3. For instance, P (Initial diagnosis = Yes) 
= 9/14. Pre-computations are prepared, and the classifier ob- 
tained can be applied. The test is done a new set of SF Fea- 
tures called Treatment. 


Treatment = (Body pains, Yes, Low, No) 
So, the probability of Initial diagnosis is denoted as 


P(body pains | Yes) P(Effected area | Yes) P(immunity / Yes) P(severity of symptoms | Yes) P(Yes) (8) 


P(Yes | treatment) = P(treatment) 





The probability of no treatment is given usually by: 





P(No treatment) = POD Pains | NoPE ffected area | No)P(immunity | No)P(severity of symptoms | No)P(Yes) (9) 


P(treatment) 


in equation (10). 


P(Yes / treatment) « —.—.—.—.— ~0.0612 (10) 
99 149 14 
And 
P(No/ treatment) x Zeke = 0.0040 (11) 
555 14 14 
Now, since 
P(No/ treatment) + P(No / treatment) =1 (12) 


However, the numbers will be transformed into probability 
by accumulating the sum is assumed to be equal to 1 accord- 
ing to normalization. 


0.0612 








P(Yes / treatment) = = 0.94 (13) 
0.0612 + 0.0040 
And 
P(No/ treatment) = SO =0.06 (4 
0.0612 + 0.0040 
Given that 


P(Yes / treatment) > P(No / treatment) 


Hence, the prediction is Initial diagnosis would be ‘Yes’. The 
method portrayed applies to the discrete dataset. However, in 
considering continuous data, Some assumptions are made re- 
garding the distribution of values of every SF Feature. Con- 
sidering different Naive Bayes classifiers, they usually differ 
by assumptions considered that they formulate concerning 
the dissemination of P (xi | y) ™. 


The run time complexity of Naive Bayesian algorithm is O(n 
K). Where n refers to no. of features and K refers to class 
labels. It can learn quickly high dimensional features with 
the limited training dataset. 


CONCLUSION 


The detection of Swine Flu is done automatically by using 
the Naive Bayesian Classifier. The complete system is ana- 
lyzed and developed. The training and test sets, which are 
the primary symptoms, are collected from a reputed hospital. 
The Artificial Intelligent System for efficient Swine Flu De- 
tection is tested and found to be genuine in detection com- 
pared to manual predictions. ESFP provides fast outcome 
with less input of data which leads to quick recognition of 
the disease and early treatment. 
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